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ABSTRACT 


Let  {Y • • • ,X^)  be  a random  vector  distributed  according 
to  a multivariate  normal  distribution,  where  X^, . . , are 
considered  as  predictor  variediles  and  Y is  the  predicteuid. 

Let  r^  emd  R^^  denote  the  population  and  sample  correlation  coef- 
ficients, respectively,  between  Y and  X^.  The  population  corre- 
lation coefficient  r^^  is  a measure  of  the  predictive  power  of 
The  author  has  derived  the  joint  distribution  of  R^,...,Rj^ 
and  its  asymptotic  property.  The  given  result  is  useful  in  the 
problem  of  selecting  the  most  important  predictor  varieUsle,  cor- 
responding to  the  largest  absolute  value  of  r^^. 
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1.  INTROUCTION 
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The  problem  of  selecting  a variable  or  several  variables 
from  a set  of  predictor  variables  {X^}  occurs  frequently  in 
the  design  of  experiments.  The  correlation  between  a predictor 
variable  and  the  predictand  Y measures  the  "leverage”  of  X^ 
upon  Y.  If  Xj^  and  Y are  jointly  distributed  according  to  the 
standard  bivariate  normal  distribution  with  correlation  coeffi- 
cient r^  then  the  conditional  distribution  of  Y given  X^  is  nor- 
2 

mad  N(rj^Xj^,l-r^) . The  larger  the  absolute  value  of  r^^,  the 

smaller  is  the  variance  of  the  conditional  distribution,  and 

therefore  higher  is  the  predictive  power  of  X^.  Thus  the  pre- 

2 

dictor  variable  corresponding  to  the  largest  value  of  r^  may 

be  considered  as  the  most  important  (best)  predictor  variable. 

Let  the  random  vector  (Y,X^,..., 

Ing  to  a multivariate  normal  distribution.  Suppose  that  a 

sample  of  n observations  is  taken  from  the  given  distribution. 

Let  r.  and  denote  the  population  and  sample  correlation 

* 2 -ik 

coefficients  between  Y and  X^^,  respectively.  Let  r^  ■ r^(l-rj^)  ^ 
* 2 -k 

zmd  “ R^(l-R^)  In  this  paper  we  derive  the  asymptotic 

* * * 

distribution  of  R « (R^,...,R^).  The  given  result  is  useful 
in  the  problem  of  selecting  the  best  predictor  variable.  The 
selection  problem  has  been  considered  recently  by  Ramberg  (1976) . 
Rizvl  emd  Solomon  (1976)  and  Alam,  Rizvi  and  Solomon  (1976)  have 
considered  the  « «m  of  selecting  from  p ^ 2 given  multivariate 
populations,  the  f .lation  with  the  largest  multiple  correlation 
between  a single  variate,  classified  as  the  predictand,  and  the 
remaining  variates. 


X^)  be  distributed  accord- 
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2.  ASYMPTOTIC  DISTRIBUTION  OP  R* 

mg 

First  we  prove  a lemma  which  will  be  used  in  the  proof 
of  the  main  result.  Let  ' * * * ' ^Kt^  denote  the  t~th 
observation  in  a sample  of  n observations  from  a K-variate 
normal  distribution  N(0,n),  and  let  U^  ■ ^t«l  ^it 

I 

0 ■ ^^1' . . • • Let  (2  ■ ...  ' and 

Q “ (u^j ) . 

Lemma  2.1.  The  asymptotic  distribution  of  U is  multi- 

A 

variate  normal  N(ne,2nn). 

Proof:  Let  be  K imaginairy  numbers  and  let  D 

be  a diagonal  matrix  whose  ith  diagonal  element  is  6^^.  Let 

k k k 

denote  the  characteristic  roots  of  where 

denotes  a symmetric  square  root  of  Q.  He  have 

■ trace 

« trace  Dil 

» 0'e 
•*0  ^ 

X?  « trace 

* trace  (Dft) ^ 

- e'fle 

where  9 ■ (9^^, . . . , 0^^) ' . 

The  characteristic  function  of  the  distribution  of  U is 

e 

E exp(0'U)  - |l-2£J^Dn^r"/^ 


given  by 


where  I denotes  an  Identity  matrix.  The  characteristic  func- 
tion of  the  normalized  distribution  of  U is  given  by 


E exp(-i  e’ (U-ne) ) - |l- /|fl^Dn^r"/^exp(- /?0'e) 
- - - n n-  . 


exp(j  e'Se) (1+0 (n”^)) 


(2.1) 


The  lemma  follows  since  the  limiting  value  of  the  right  hand 
side  of  (2.1)  as  n-*«e  is  equal  to  the  characteristic  function 
of  the  multivariate  normal  distribution  N(0#0).Q 

Now  we  consider  the  distribution  of  R*.  Without  loss  of 
generality  we  can  assvime  that  the  variables  Y,  are 

standarlzed,  that  is,  they  are  distributed  with  mean  0 and 
variemce  1.  Let  r^^  denote  the  correlation  coefficient  between 

and  Xj  and  let  I ■ I “ ^^t'^lt' ’ * * '*Kt^ 

denote  the  t-th  observation  in  the  sample,  and  let 
_ i V Inn  » nn  2 


^i  - i ^t-i^it'  ^ - St-i^t' 


Vi  - 


(2.2) 


"i  - - ''i 


(2.3) 


From  the  theory  of  linear  regression  analysis  it  is  seen  that 
d 2 2 

(l-r^)Xjj_2»  chi-square  with  n-2  degrees  of  freedom,  inde- 
pendent of  and  Y ■ (Y,  ,...,Y^)'  and  V^?N(r^S,l-r^)  auid 


COV conditionally  given  Y.  Let 
and  j ) • It  is  also  seen  that  can  be  represented  as 

the  sum  of  squares  of  (n-2)  orthogonal  linear  functions  of 
t m • m f Xj^^  • That  is 


rn”2 

ifl 


(2.4) 


Where  Z^«  ^^It' * * * ' *kt^ ' identically  auid  Independently 

distributed  as  N(0,n),  independent  of  V. and  S. 

^ X K 

Let  T a (T^, . . . ' be  a random  vector  distributed  as 

N(Orn)»  independent  of  S and  W ■ (W, , . . . ,W_) ' . Then 

RJ  - V^(W^)“^ 

d (T^+r^S)W^“^*.  (2.5) 

Therefore 

Theorem  2.1.  The  joint  distribution  of  the  saunple  corre- 
lation coefficients  between  the  predictand  and  the  predictor 
variables  of  a multivariate  normal  distribution  is  given  by 
(2.5),  where  TfN(0,ft),S^  s the  distribution  of  W is  given 

by  (2.4).  Moreover,  (S,T,W)  are  jointly  independent. 

•»0 

For  large  n,  W is  asymptotically  distributed  as  N((n-2)f, 

2(n-2)n)  by  Lemma  2.1,  where  f * (1-r^, . . . ,1-r^) ' and 
2 

n ■ Therefore 

Corollary  2.1.  The  asymptotic  distribution  of  R*  is  given 

by  (2.5),  where  T ? Nro,a),  i » W ? N ( (n-2)  f ,2  (n-2)S)  and 

- n— 1 

(S,T,W)  are  jointly  independent. 

Let  a ^^ij“^i^j  ^ ^ ^(l-r^)”^  and  r a (Vj^j)* 

I 

I 
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From  (2.5)  we  have  for  large  n 


*^(Rj-r*)  » ^ rj(( 


(l-r2)s2 


w. 


-) 


1)  0„(n"^) 

P 


-k 

- T . + — (A-B . ) + (n  ’) 
i ^ i P 


(2.6) 


Where  (0,1),  B-  (B, , . . . ,B_) '?  N(0,r).  Moreover,  (T,A,B) 

^ X Jv  ^ ^ 

are  jointly  Independent.  Therefore  •^(R*-r*)  is  asymptotically 
distributed  as  N(0,ll+— |-(r+E) ) where  E » ®ij  * 


Let 


a + »^(r+E) . 


The  elements  of  C are  given  by 


'ii 


1-r?  + r*^ 


(r. . "r^r , ) 

^ij-Vj  ^ <2.7) 


^ij  ■ *ij  *i‘j 


(l-rp  (1-rp 


Therefore 

Corollary  2.2.  For  large  n,  ✓5r(R*-r*)  is  asymptotically 
distributed  as  N(0,C),  where  C is  given  by  (2.7). 

It  is  interesting  to  consider  the  following  special  cases: 
(1)  “ 0,  r^^  *0,  i 7^  j for  all  i and  j,  that  is,  the 

variables  Y,  X^,...,Xj^  are  jointly  Independent.  We  have  C « I 
and  »^(R*-r*)^ (0,1) , asymptotically.  (2)  r^  ■ 0,  r^^  ■ p, 
i j for  all  i and  j,  that  is  the  predictor  variables  X^,...,X^ 
are  equi-correlated  and  independent  of  Y.  We  have  * 1, 

Cij  - P,  i j.  (3)  r^  - D,  r^j  - 0,  i ^ j for  all  1 and  j , 
that  is  the  predictor  variables  are  jointly  independent  and 
equi-correlated  with  Y.  We  have 
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Now  we  consider  the  problem  of  selecting  the  best  predic- 
tor variable.  A standard  procedure  is  to  select  the  variable 

from  the  predictor  variables  corresponding  to  the  largest  value 

2 

of  the  squared  correlation  coefficients 
2 2 

R*  . By  Corollary  2.2  the  prob2d3ility  of  a correct 

selection  can  be  derived  for  large  n from  the  multivariate  nor- 
mal distribution  function.  In  the  special  case  (1)  we  have  that 
n mauc  / • . • » 

2 

statistic  in  a sample  of  K observations  from  X]^~ch^~square  with 
1 degree  of  freedom.  This  result  can  be  used  also  to  test  the 
significance  of  the  correlation  between  the  selected  predictor 
vari2d}le  and  the  predictand.  Similar  results  are  obtained  for 
the  cases  (2)  and  (3). 


2 

R*  ) is  distributed  as  the  largest  order 


Rj^  or  equivalently 
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